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ABSTRACT 

We stacked the X-ray data from the ROSAT All Sky Survey for over 4,000 
clusters selected from the 2MASS catalog and divided into five richness classes. 
We detected excess X-ray emission over background at the center of the stacked 
images in all five richness bins. The interrelationships between the mass, X-ray 
temperature and X-ray luminosity of the stacked clusters agree well with those 
derived from catalogs of X-ray clusters. Poisson variance in the number of galax- 
ies occupying halos of a given mass leads to significant differences between the 
average richness at fixed mass and the average mass at fixed richness that we 
can model relatively easily using a simple model of the halo occupation distribu- 
tion. These statistical effects probably explain recent results in which optically- 
selected clusters lie on the same X-ray luminosity-temperature relations as local 
clusters but have lower optical richnesses than observed for local clusters with 
the same X-ray properties. When we further binned the clusters by redshift, 
we did not find significant redshift-dependent biases in the sense that the X-ray 
luminosities for massive clusters of fixed optical richness show little dependence 
on redshift beyond that expected from the effects of Poisson fluctuations. Our 
results demonstrate that stacking of RASS data from optically selected clusters 
can be a powerful test for biases in cluster selection algorithms. 

Subject headings: X-rays: galaxies: clusters 

1. Introduction 

The primary objective of many new extragalactic surveys is to determine the equation 
of state of the dark energy. One approach is to determine the evolution of the cluster 
mass function with redshift, which depends on dark energy through the growth factor and 
the volume element (e.g., Haiman, Mohr & Holder 2001; Huterer & Turner 2001; Podariu 
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& Ratra 2001; Kneissl et al. 2001; Newman et al. 2002; Levine, Schulz, & White 2002; 
Majumdar & Mohr 2003; Hu 2003). Clusters are identified using some observable proxy for 
their mass over a broad range of redshifts and the proxy must then be calibrated to compare 
the results to the theoretically predicted mass functions. Possible proxies and identification 
methods are the optical richness of the cluster (e.g., Abell 1958; Huchra & Geller 1982; Dalton 
et al. 1992; Postman et al. 1996, 2002; Zaritsky et al. 1997; Gonzalez et al. 2001; Bahcall et 
al. 2003; Gal et al. 2003; Gladders & Yee 2005), the X-ray luminosity or temperature (e.g., 
Gioia et al. 1990; Henry et al. 1992; Rosati et al. 1995,1998; Ebeling et al. 1998, 2000; 
Bohringer et al. 2000, 2004; Bauer et al. 2002; Giacconi et al. 2002), the Sunyaev-Zeldovich 
decrement (e.g., Carlstrom et al. 2000; LaRoque et al. 2003) and the weak lensing shear 
(e.g., Wittman et al. 2001; Dahle et al. 2002; Schirmer et al. 2003). Examples of large scale 
surveys planning on using these methods are the Dark Energy Survey (DE£§ using optical 
richness, Sunyaev-Zeldovich and weak lensing), the Large Synoptic Survey Telescope (LSST@ 
optical richness and weak lensing) and the Supernova / Acceleration Probe (SNAF0 optical 
richness and weak lensing). This is in addition to smaller scale surveys based on optical 
richness such as the SDSS cluster surveys (Goto et al. 2002; Bahcall et al. 2003) and the 
Red Cluster Sequence Survey (RCS, Gladders & Yee 2005). 

A persistent worry about optically selected cluster samples is that chance projections of 
foreground and background galaxies significantly bias the resulting catalogs. The problem 
can range from false positives, detections of non-existent clusters, to a richness bias in which 
chance projections lead to overestimates of the cluster richness. The problems should be 
more severe for higher redshift and lower richness clusters because the effects of a chance 
projection increase as the number of detectable galaxies in the cluster diminishes. One 
approach to checking the extent to which a cluster finding algorithm is affected by these 
issues is to generate mock galaxy catalogs matching the actual data as closely as possible, 
find clusters in the mock catalogs and then compare the output and input catalogs (e.g., 
Kochanek et al. 2003; Miller et al. 2005). In general these comparisons have found only 
modest biases. A second approach is to compare the mass estimates based on the optical 
richness to those from another method. At present this has meant examining either the 
X-ray properties (e.g., Bahcall 1977; Donahue et al. 2002; Kochanek et al. 2003; Popesso et 
al. 2004) or the weak lensing masses (e.g., SDSS/RCS) of the optically-selected clusters. 

A basic problem of most existing tests of optically-selected cluster catalogs using X-ray 
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data is that the comparisons are made to catalogs of X-ray clusters rather than through 
X-ray observations of the optically-selected clusters. While this provides a simple means 
of calibrating the relationship between optical and X-ray properties, it is not an optimal 
approach to searching for biases in the optical selection methods. The problems could be 
more severe because several recent studies (e.g., Bower et al. 1997; Lubin, Mulchaey & 
Postman 2004; Gilbank et al. 2004) have found offsets between the X-ray temperature- 
optical richness relations of local and higher redshift optically-selected clusters even though 
the clusters lie on the same X-ray temperature-luminosity relations. 

In this paper we address these questions by measuring the mean X-ray properties of 
a sample of galaxy clusters selected from the 2MASS survey (Kochanek et al. 2003, 2006 
in preparation) by averaging ("stacking") the ROSAT All-Sky Survey (RASS, Voges et 
al. 1999) data for the clusters as a function of richness and redshift. We outline the cluster 
catalog and our procedures for analyzing the RASS data in §2. In §3 we discuss the theoretical 
differences between measuring the mean properties of clusters at fixed richness rather than 
fixed mass. In §4 we present our results for the correlations between the optical richness, 
X-ray properties and cluster mass estimates, as well as the dependence of the results on 
cluster redshift. We summarize our results in §5. We assume that H = 70 km s _1 Mpc -1 , 
Q m = 0.3, and = 0.7 throughout the paper. 

2. The Cluster Catalog and The X-ray Data 

The cluster catalog we use was selected from the 2MASS infrared survey (Skrutskie et 
al. 2006) using the matched filter algorithm of Kochanek et al. (2003) applied to the 380,000 
galaxies with K< 13.25 mag (2MASS 20 mag/arcsec 2 circular, isophotal magnitudes) and 
Galactic latitude |6| > 5°. For the purposes of this paper we are interested only in the 
richness iV* 666 and the redshift z of the cluster. The richness iV* 666 is defined to be the 
number of L > galaxies inside the (spherical) radius where the galaxy over density 
is An = 200Qjj ~ 666 times the mean density based on the galaxy luminosity function 
of Kochanek et al. (2001). This radius would match the radius with a mass over density 
of Am = 200 for Qm = 0.3 and a bias factor of unity. Clusters were included in the final 
catalog if they had likelihoods InL > 10 and if the cluster redshift was below that at which 
the cluster would contain one galaxy inside r^ 666 at the magnitude limit of the surveyQ These 
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Fig. 1. — RASS images of four clusters over-plotted with probable cluster members (squares, 
> 50% likelihood of membership as determined from the cluster finding algorithm in 
Kochanek et al. 2003) and field galaxies (X symbols, < 50% likelihood) along with the 
error ellipse for the cluster centroids provided by optical cluster detection algorithm. Panel 
A shows the typical cluster, where the optical position is roughly consistent with the X-ray 
position. Panel B shows an example where the existence of a cluster is correctly inferred in 
the optical catalog, but the position is seriously in error. Panels C and D show two examples 
where the optical cluster is confused by the presence of multiple (or merging) clusters. 
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Fig. 2. — The stacked ROSAT images of 2MASS clusters in richness bins 0-4 and at random 
positions. The images are 2.0, 2.0, 1.5, 1.0, 1.0, and 1.0 /i^Mpc in radius for richness bins 0-4 
and the random positions. The contours are drawn at levels of 1-4 a above the background. 
Excess X-ray emission is clearly detected in all richness bins, whereas the stacked image 
using random positions does not show any significant emission at the center. 
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selection limits were determined by examining how cluster likelihoods changed as a function 
of the limiting magnitude of the input galaxy catalog. We divided the cluster catalog into 
five richness bins, iV* 666 > 10, 10 > iV* 666 > 3, 3 > iV*666 > 1 an d 1 > N* 666 > 0.3, which 
we will refer to as richness bins (richest) to 4 (poorest) respectively. There are 191, 862, 
1283, 1298 and 699 clusters in the bins. We matched our cluster catalog to the Abell cluster 
catalog and found most matches are from our richness bins 0, 1, and 2. 

We use the RASS to determine the average properties of the optically-selected clusters, 
since it is the only recent X-ray survey with the necessary sky coverage. Pointed observations 
with ROSAT, Chandra, or XMM-Newton are not useful because they would include only a 
small fraction of the objects in our catalog and represent a biased sampling of the catalog. 
The RASS data are fairly uniform in both integration time and the average point spread 
function because the data were obtained by scanning the satellite over the sky. The RASS 
data and the exposure maps, arranged in 6.4 x 6.4 square degree fields, were obtained through 
the High Energy Astrophysics Science Archive Research Center (HEASARC0). We extracted 
the X-ray data in fixed physical regions around the clusters of 2.0, 2.0, 1.5, 1.0, and 1.0 
hj^Mpc radius for richness bins 0-4. These scales were chosen to be large enough to estimate 
the mean X-ray backgrounds since the background must be subtracted for our analyses of the 
X-ray images and spectra. We ignored the 17% of the clusters for which the extraction regions 
extended outside the edges of the standard RASS images. The typical exposure time for each 
cluster is ~ 400 sec, and we dropped the ~ 1% of clusters with exposure times exceeding 
1500 sec (mostly near the poles of the RASS survey) so that they would not dominate 
the signal-to-noise ratio of the average. Since we are testing the optical catalog, dropping 
clusters because of the characteristics of the X-ray survey should have no consequences for 
our results. We did not distinguish between the different observing stripes of the RASS, 
which have different observing times and backgrounds, instead simply using the average 
exposure time and background of the combined data. These differences were usually small 
(less than 10%) and should have little effect on our analysis. Some examples of the optical 
and X-ray clusters are shown in Figure [TJ 

We must also remove contaminating sources from each cluster field. We removed the 
bright point sources in the RASS bright source catalog (Voges et al. 1999) and replaced them 
with the average of the regions surrounding them (annuli with inner and out radii of 1.5 and 
2.5 times the extent of the point sources). As for the bright extended sources detected 
in the RASS bright source catalog, in most cases they are the clusters we need to stack, 
although we identified and excluded two clusters with emission from supernova remnants 
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in our extraction region. Finally, approximately 4% of the clusters had additional optical 
clusters in the extraction annulus. In these cases, we only analyzed the richest cluster. In 
the cases (five in total) where two clusters are of comparable richness, we excluded both 
of them as the optical positions tend to be poorly estimated, and sometimes, the optical 
detection algorithm only found one cluster, locating the optical centroid between the two 
X-ray clusters. One such example is shown in Figure [TH. After applying all these filters, we 
were left with 157, 670, 1004, 1057, and 497 clusters for richness bins 0-4, respectively. 



2.1. Surface Brightness Profiles 

We produce the stacked images by rescaling the data for each cluster to a common 
distance of 100 Mpc. In addition to adjusting the photon positions, we weight each photon 
by the square of the ratio between the cluster distance and 100 Mpc. We binned the X-ray 
images with pixel sizes of 100, 100, 75, 50, and 50 /i^kpc for richness bins 0-4 and then 
smoothed them with a Gaussian with a dispersion of one pixel. We clearly detect X-ray 
emission from the clusters in all five richness bins, as shown in Figure [2j The significance 
contours in Figure [2] apply to the individual pixels, so the sources as a whole are detected 
at much higher confidence levels than indicated by the contours. For example, although the 
peak of the emission from the poorest richness bin is only 2a above the background, the 
emission as a whole is detected at about 7a. We also used bootstrap re-sampling to test the 
significance of the X-ray emission of the poorest richness bin. First, we tested whether the 
emission was dominated by a small subset of the objects by randomly drawing identically 
sized samples from the original sample with replacement and then analyzing the simulated 
sample. In each of the 100 trials there was excess X-ray emission detected at over 3a, and in 
94 out of 100 trials the cluster was detected at over 5a. If, on the other hand, we stack data 
from an equal number of random positions with |6| > 5° and excluding our standard regions 
near the 2MASS cluster positions, only 8 out of 100 trials have 3a detections and no trials 
are detected at over 4a. One such image is shown in Figure [2J Based on these simulations, 
we estimate that we detect the richness 4 clusters at greater than the 99% confidence level. 
In the outer, background regions of the images, the different richness bins have consistent 
background flux levels and when azimuthally smoothed they show the flat profile expected 
for convergence. 

The background-subtracted surface brightness profiles for each richness bin are shown in 
Figure [31 where we have used radial bins of constant area, normalized the surface brightness 
profile by the optical richness iV* 666 and scaled the results to a luminosity distance of 100 Mpc. 
We subtracted the background for each cluster before we rescaled the data. Like the stacked 
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Fig. 3. — The stacked X-ray surface brightness profiles as a function of richness and for 
random positions. All clusters were background subtracted and normalized to a common 
distance of 100 Mpc and by their iV* 666 values. The dashed lines indicate the level of the 
subtracted background and the dash-dotted lines are the best fit (3 models for each bin. 
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images, the surface brightness profiles clearly show excess emission above the background 
for all richness bins, while the profile from stacking random positions does not. We fit 
the profiles with a standard (3 model, S(R) = S (l + R 2 / ' R 2 )~^ +l l 2 obtaining the results 
presented in Table [TJ We obtain similar results whether we fit the background-subtracted 
profiles directly or include an additional parameter to fit any residual background level. 
When we include an additional parameter for the residual background, its value is found 
to be consistent with zero. Our estimates of the slope (3 are consistent with typical values 
for clusters (e.g., Xu et al. 2001; Sanderson et al. 2003), even consistent with the trend 
of decreasing (3 with decreasing cluster temperature (e.g., Horner et al. 1999; Sanderson et 
al. 2003) for richnesses 0-3. The (3 values obtained for the two poorest richness bins have 
large uncertainties. We do, however, find larger core sizes than studies of individual clusters 
(e.g., Neumann & Arnaud 1999; Xu et al. 2001; Sanderson et al. 2003; Osmond & Ponman 
2004), ranging from roughly twice as large for richness bin to more than 5 times as large for 
richness bin 4, although the estimates for the poorest richness bins are very uncertain. This 
is not due to the RASS PSF, which is equivalent to a source size of ~ 20 kpc at a typical 
cluster redshift of z ~ 0.04. Part of the problem must be a smearing of the X-ray cores 
by errors in the estimates of the optical centroids. We matched our stacked clusters with 
the ROSAT extended source catalog and plotted the histogram of offsets between optical 
and X-ray centroids in Figure HI The median offset of 0.14 Mpc is consistent with the large 
core sizes obtained for our stacked images in richness bins and 1 compared with those 
obtained from individual clusters. The distribution of offsets is very similar to other studies 
(e.g., Adami et al. 1998; Lopes et al. 2006) when comparing optically-selected clusters and 
ROSAT clusters. Some of the large offsets are caused by merging clusters where the X- 
ray image has two cores and the optical centroid is set in between of the X-ray cores (e.g., 
Figure [lb), and in some other cases the large offset could be due to various other problems 
(e.g., Figure [lb). We also show one case where the optical centroid matches excellently with 
the X-ray centroid in Figure [T^l. Although we obtain larger cores in our stacked images, the 
average X-ray properties of the clusters should be little affected. 



2.2. Cluster Luminosities, Temperatures and Entropies 

In order to estimate the luminosities and temperatures we need to model the X-ray spec- 
tra of the clusters. We started by eliminating the 13% of clusters with Galactic absorption 
column densities higher than 10 21 cm -2 . We analyzed the spectra using XSPEC (Arnaud 1996) 
using the rmf file for the PSPCC detector available from HEASARC. Although the RASS 
contains some PSPCB observations, they are a small minority of the observations and the 
rmf files for the two detectors are very similar. Because the RASS was carried out by scan- 
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ning the sky, there is one additional complication arising from the corrections for vignetting 
and the off-axis telescope response. We can use two possible calibration files for our analyses. 
The first approach is to use the standard rsp file provided in the calibration package (which 
is equivalent to the on- axis arf +rmf files) combined with the vignetting-corrected exposure 
map. This approach will bias the spectral fits because the true vignetting correction is 
energy-dependent. The second approach is to use the off-axis arf file. This will lead to un- 
biased spectral fits, but the RASS images have already corrected the exposure times for the 
effects of vignetting, so we will be double-counting this correction when we use the off-axis 
arf file. Fortunately, this simply leads to a normalization offset in the 0.1-2.4 keV flux of 
a constant factor of 1.44 that we can determine from the flux differences between the two 
approaches. We adopt this latter approach with the correction to the fluxes. We generated 
the off-axis arf files using the pcarf software tool from the XSELECT and FT00LS packages. 
We extracted the spectra of the stacked clusters out to radii of 1.5, 1.5, 0.9, 0.65, and 0.4 
Mpc for richness bins 0-4, respectively. For each cluster we subtracted the background 
and normalized the data to the mean redshift of the stacked clusters (z = 0.0816, 0.0678, 
0.0539, 0.0387, and 0.0273 for richness bins 0-4) and then averaged the arf files based on 
the weighted number of photons in each cluster. Averaging of arf files has little effect on 
the results because they are all fairly similar due to the nature of the RASS observations. 

We fit the spectra with a thermal plasma model (Raymond & Smith 1977) modified by 
the mean Galactic absorption. We were unable to determine the metallicity from the data, 
so we assumed a metallicity of 1/3 solar (e.g., Baumgartner et al. 2005). The redshift was set 
to the mean redshift of the clusters in each richness bin. We did not worry about the small 
spread in the true K-corrections for the spectra across each bin because the low redshift of 
the sample (see Table 1) makes this a small effect. The spectra and their best fit models are 
shown in Figure [5] and the fitting results are presented in Table [2j We obtained acceptable 
fits for all richness bins, indicating that the emission is consistent with that from hot gas for 
all richness bins. As expected, the temperature decreases from rich to poor clusters, and the 
soft X-ray sensitivity of ROSAT (0.1-2.4 keV) allows us to measure the temperature of the 
poor clusters more accurately than for rich clusters. Given the temperatures, we can also 
estimate the bolometric X-ray luminosity. Table [2] presents the estimate for the luminosity 
within -R*666 found by using the f3 model fits from §2.1 to correct from the spectral extraction 
aperture to -R*666- These corrections were quite small, amounting to multiplicative factors 
of only 1.06, 0.92, 1.01, 0.99, and 1.03 for richness bins 0-4, respectively. In addition to 
the error estimates provided by the spectral fits, we also estimated the uncertainties by 
bootstrap re-sampling the data. For each richness bin we generated 500 random samples 
by drawing the same number of clusters from those in each richness bin with replacement. 
These bootstrap estimates of the uncertainties should represent systematic uncertainties 



better than the purely Poisson uncertainties of the standard fits. In general, the bootstrap 
uncertainties are slightly larger than the standard errors found from the spectral fits, but 
usually not by large margins. 

While the best fit Galactic Ah values are consistent with our A^h cut of 10 21 cm -2 , there 
is a worrisome correlation of Ah with richness. The estimate of the Galactic absorption is 
dominated by the lowest energy bins. We ran several fits with fixed A^h values in the range 
spanned by the results of Table [2J finding that the overall fits become worse for the richness 
bins best fit by a different value for A^h, but that the luminosity and temperature estimates 
change little compared to the statistical errors of the original estimates. We found that we 
could not analyze the high Galactic absorption clusters at all because of strong degeneracies 
between the estimates of the temperature and the absorption. Potentially, the origin of the 
problem could be the presence of an additional soft emission component in the low richness 
clusters, but we will not investigate this possibility here because it has no effect on our 
general results and is an area of considerable controversy (e.g., Bregman & Lloyd-Davies 



We also estimated the mean gas entropy, S = T/nJ , as a direct test of the thermody- 
namic state of the clusters and the effects of heat transport or energy injection (e.g., Ponman, 
Cannon, & Navarro 1999; Lloyd-Davies, Ponman, & Cannon 2000; Ponman et al. 2003). 
We estimated the electron number density n e from the surface brightness profiles derived in 
§2.11 and the temperature from the spectral fits. We assumed a constant temperature with 
radius. As we discussed in §2.11 we systematically obtain larger core radii than analyses of 
individual clusters, and an underestimate of the central surface brightness will lead to an 
overestimate of the central entropy. Therefore, we focus on the entropy on scales of 0.5r*666 
where we should be insensitive to this problem. 

We further divided the each of the first four original bins into two narrower bins. We 
followed the same procedure described in this section to obtain their temperatures and the 
bolometric X-ray luminosities (Table [2]). 



Finally, we estimated the mean cluster masses inside r* 66 6 assuming hydrostatic equi- 
librium, 



using our /3-models from §2.1[ fi = 0.61 (Arnaud 2005a), and assuming an isothermal temper- 
ature profile. This is not a trivial exercise given the stacked data. First, we cannot accurately 
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measure the mass profile in the inner regions of the clusters where we can accurately measure 
the temperature because of the smearing created by the position uncertainties discussed in 
§2.1. Second, we cannot accurately measure the temperature in the outer regions, or equiv- 
alently, the slope of the temperature profile, because we have insufficient counts in the outer 
regions. We expect the first issue is less severe since the mass obtained by this method 
mainly depends on the temperature and its gradient at a sufficiently large radius, which 
is less affected by the profile of the inner slope. The isothermal assumption will possibly 
lead to an over estimate of mass by ~15% (Sanderson et al. 2003). These uncertainties are 
smaller than those introduced from cluster temperature measurements for most cases. We 
estimated r 20 o and M 20 o from the mass profiles by calculating the mass and radius where 
the mass over density is 200 times the critical density. We also performed the same analysis 
to the narrower richness bins. Again, we expect the dominant uncertainties for masses are 
from the cluster temperatures. 

There are offsets between the M*666 and M 2 oo mass estimates that simply reflect the 
differences between r* 666 and r 20 o- The two radii are not simply proportional to each other, 
as illustrated in Fig. [6J The difficulty probably arises because r* e6G is computed from the 
fitted richness and a fixed three dimensional galaxy distribution because it is not possible to 
determine the galaxy distribution for the individual clusters with any accuracy (see Kochanek 
et al. 2003). To investigate this further we need to examine the optical properties of 
the stacked clusters, which will be the subject of a later study (Morgan et al. 2007 in 
preparation). Our estimate of r 2 oo from the stacked X-ray images is consistent with scaling 
relations from the measured temperatures, as illustrated by the values of r 180 derived from 
the temperature and the temperature-radius scaling relation from Evrard et al. (1996). 
Note, however, that the shifts in the masses due to the different radii are too small to affect 
the overall structure of the mass-temperature-richness correlations. 



Before discussing the results from measuring the mean luminosity, temperature or mass 
as a function of optical richness, it is necessary to discuss the important distinction between 
averaging at fixed richness and averaging at fixed mass (see Berlind & Weinberg 2002). For 
simplicity, we will assume a simple model for the halo mass function, 



with x ~ 1.88 and log(Mi/M Q ) ~ 14.6 and y ~ 0.73 closely reproducing a concordance 
cosmology mass function at z = 0. Since only higher mass halos form groups and clusters 



3. Averaging Cluster Properties at Fixed Optical Richness 
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of massive galaxies (our cluster finder is not designed to find galaxies with halos of lower 
luminosity satellites), we must truncate the halo mass function hm{M) at some point for it 
to model the cluster mass function n c (M). We model this by using n c (M) = hm{M) for 
M > M and then truncating it at lower masses 

n c (M) = n M (M ) exp (- ^ ^ Mo) ) for M < M (3) 

over a logarithmic mass range A. On average, a cluster of mass M contains 

observable galaxies, where Ni is the number of L > galaxies for a cluster of mass Mi 
and the ratio of Gamma functions c(z) corrects from the number of L > galaxies to the 
number of galaxies down to the luminosity L(z) corresponding to the flux limit of the galaxy 
survey at the cluster redshift z. The galaxy luminosity function is modeled as a Schechter 
function with slope a = —1.09 and break luminosity Mk* = —24.16 mag assumed by the 
cluster finder based on the Kochanek et al. (2003) K-band luminosity functions and with 
a limiting magnitude of K Hm = 13.25 mag. A particular cluster, however, will contain n 
galaxies drawn from the conditional probability function for a cluster of mass M having n 
galaxies, 

P(n|M) = ^exp(-(n>) (5) 

which we assume to be the Poisson distribution. In general, the cluster catalogs are roughly 
complete to the limit of n > 3 inside the radius of the filter (see White & Kochanek 2002; 
Kochanek et al. 2003), and this turns out to be the best fit estimate of the selection limit 
when we apply the Poisson model to the data. The richness estimate assigned to the cluster 
is not the observed number of galaxies n, but the estimated number of L > galaxies, = 
n/c(z). For our analytic models, we will further assume that there are perfect correlations 
T(M) and Lx{M) between the X-ray properties and the mass. The combination of these 
assumptions is essentially a simple example of a halo occupation distribution (HOD) model 
(e.g. Yang et al. 2005, Zheng et al. 2005) for the clusters. 

When we stack the X-ray data for cluster of fixed observed richness, iV*, we compute 
mean values of the form, given here for the mean mass, 

(M(N„z))= sriMnc(M)P(NAz)m ■ W 

If we set 7 = 1, so that the number of galaxies is proportional to the mass, and use a sharp 
cutoff A = in the cluster mass function, then the integrals can be done analytically to find 
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l + N lC (z)T[l .. 

where T[a, 6] is an incomplete Gamma function. If we had averaged at fixed mass, we would 
simply find that the mean mass is M(N*) = M^N^/Ni), and it is interesting to examine 
how the average at fixed richness differs from that at fixed mass. If we consider rich clusters 
with many galaxies (M 3> M , N*c(z) 3> 1) then we find that 

1 i -i 



(M(N*,z)) 
M(N m ) 



1 + 



1 — x 
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N lC (z) 



N*c(z)_ 

The Poisson fluctuations bias the average mass at fixed richness to be low, with a fractional 
amplitude of order the number of detectable galaxies in the cluster. This is essentially a 
Malmquist bias due to the steep mass function - more low mass clusters Poisson fluctuate 
up to the fixed richness than high mass clusters fluctuate down to it. The more interesting 
limit is that of very low richness clusters. Very low richness clusters will always be close to 
the cluster detection limit, so let us set N*c(z) = 3 and take the limit of very low richness 
{N* -> 0) to find that 

(M(JV„z)>-Mo. (9) 

The only way to get an apparently very low richness cluster is as a Poisson fluctuation of 
a cluster with M ~ Mo, so the mean mass converges to Mo. Thus, compared to averaging 
properties of clusters at fixed mass, clusters averaged at fixed luminosity should be biased 
lower in mass for rich clusters and higher in mass for poor clusters even before considering 
any selection effects. As we shall see, this simple model explains all the results that follow. 



4. Sample Correlations 

We summarize the general correlations in Figs. I71TT21 We start with the correlations 
familiar from studies of X-ray clusters. Figure [7] shows the correlation between X-ray lumi- 

Table 1. Average Cluster Surface Brightness Profiles 
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Fig. 4. — Histogram of offsets between optical and X-ray centroids. 

Table 2. Spectral Fitting Results 
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Note. — The ^666, R*666, and z values listed here are the average values of each bin. The 
uncertainties of Lx and T are the bootstrap estimates. 
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Fig. 5. — The stacked ROSAT spectra of clusters in richness bins 0-4 and their best fit 
thermal plasma models. 
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Fig. 6. — The relationship between cluster scaling radius and optical richness N*^. The 
filled circles are r*666 obtained from optical algorithm, the squares are rigo converted from 
mean temperature (Evrard et al. 1996), and the X symbols are r 2 oo calculated from our 
mass profile for stacked clusters. The radius r* 666 is algebraically related to 7V* 666 because 
of the fixed model for the spatial distribution of the galaxies (see Kochanek et al. 2003). 
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Fig. 7. — The relationship of the bolometric X-ray luminosity to the temperature. The open 
and filled circles are the results from our stacking the clusters as a function of richness, and 
the solid line is the best fit Poisson model (see §3) to these results. The filled circles are for 
the five standard richness bins, while the open circles are the results found for the narrower 
bins obtained by dividing the first four richness bins in half. The open triangles are the 
results found for the middle redshift bin (see Figs. [131 fl4j) when we divide the clusters in 
redshift sub-bins. The error bars on the circles are the bootstrap estimates of the errors. 
The squares show the results for individual 2MASS clusters drawn from the literature. The 
dashed, dotted, dash-dotted, and dash-dot-dot-dotted lines are from the Lx-Tx relations 
from Wu et al. (1999), Helsdon & Ponman (2000), and Xue & Wu (2000), and Rosati et al. 
(2002). 
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Fig. 8. — The relationship between mass M and temperature T, using the subdivided bins 
for the first four richness bins. The filled circles are the M 20 o values for the stacked clusters, 
and the solid line is our best fit Poisson model (see §3). The open circles are the 
values, and the error-bars of the open circles are similar to those for the filled circles. The 
dashed, dotted, dash-dotted, and dash-dot-dot-dotted lines are the relations from Arnaud et 
al. (2005b), Xu et al. (2001), Sanderson et al. (2003), and Neumann & Arnaud (1999). 
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nosity and temperature, Figure [8] show the relation between cluster mass and temperature, 
and Figure [9] shows the relation between entropy and temperature. In each case we show 
the results for the stacked clusters, the results of our global fit to the cluster properties (the 
"Poisson model", except Fig. [9]) and several similar scaling relations from the literature. 
In some cases we also show the measured properties of individual 2MASS clusters drawn 
from the literature (Ponman et al. 1996; Ebeling et al. 1996, 1998; De Grandi et al. 1999; 
Bohringer et al. 2000; Mahdavi et al. 2000; Cruddace et al. 2002; Reiprich & Bohringer 
2002) and corrected to our assumed cosmology and to a bolometric X-ray luminosity. 

The Lx — T relation of the stacked clusters closely resembles that derived from obser- 
vations of individual clusters (Fig. [7]). There is a clear break in the slope between high and 
low temperature clusters. If we fit the results with a broken power law (see Table [3]) we find 
slopes for rich (2.7 ± 0.7) and poor (5.8 ± 1.7) clusters consistent with studies of individual 
clusters (2.5 — 3.0 for rich clusters, Wu et al. 1999; Rosati et al. 2002 and 4 — 7 for poor 
clusters, Helsdon & Ponman 2000; Xue & Wu 2000). The correlation we observe does not 
depend on the redshift when we subdivide our cluster catalogs into redshift bins. Similarly, 
the relationship between mass and temperature, shown in Fig. E]for both M 20 o and M„, 666 , 
is well-fit by a single power law with M* 666 oc T L79±017 and M 200 oc T l b9±Q17 (see Table [3]) 
The results are generally consistent with relations derived from samples of clusters (e.g. Xu 
et al. 2001; Neumann & Arnaud 1999; Sanderson et al. 2003; Arnaud et al. 2005b) as well 
as with theoretical models (Evrard et al. 1996). The offsets between M* 666 and M 20 o created 
by the differences between r* 666 and r 2 oo discussed in §2.31 have little effect on the results. 
Finally, our estimated entropies are inconsistent with a simple S (x T scaling relation, as 
shown in Fig. [9j They are, however, consistent with the shallower S oc T a65 relation of Pon- 
man et al. (2003) or a break in the entropy for halos with the temperatures of order 1 keV 
where we observe a break in the Lx — T relation. In summary, the X-ray properties of the 
stacked clusters are essentially indistinguishable from those found from studies of samples of 
individual clusters. 

Figs. rTTJHT2l show the correlations of the X-ray properties and the mass with the richness 
iV* 666 . At a first glance, these correlations may seem to be inconsistent with the X-ray and 
mass correlations we just discussed if the richness is a simple power-law of the mass (Eqn. 
4). We carried out extensive tests to verify these correlations such as subdividing the sample 
by redshift, and using the soft-band rather than the bolometric luminosity. Using the soft- 
band luminosity avoids the need to accurately measure the temperature, and we expect any 
systematic problems in the richness estimate to be strongly redshift dependent. In fact, 
these correlations are the consequence of averaging at fixed optical richness in the presence 
of Poisson fluctuations. We can illustrate this by fitting all the correlations using the simple 
Poisson model of §3. We assume a power law relation for T(M) and iY*666(^0; a broken 
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power-law relation for Lx{T), and a cluster mass function cutoff at mass scale Mq over a 
mass range set by A. Most of these parameters are simply the relations we need to fit 
the individual relations, with two parameters added to truncate the distribution of clusters 
at low mass. The parameters of these relations are summarized in Table [31 and we find 
a sensible mass cutoff of log M /M & ~ 13.46 ± 0.08 that needs to be slightly softened by 
A ~ 0.18 ± 0.03 in order to reproduce the gradual decline in the X-ray temperatures and 
luminosities toward very low richness. The mass-richness relation is 

logiV* 666 = (1-10 ± 0.04) + (0.87 ± 0.05) log(M/Mi). (10) 

Our earlier result in Kochanek et al. (2003) was that logiV* 666 = (1.44 ± 0.17) + (1.10 ± 
0.09) log(M/lO 15 M ) so the normalizations are consistent while the estimates of the slope 
differ by about 2a. The present, shallower slope is very similar to that of Lin, Mohr & 
Stanford (2004) or Popesso et al. (2005). Although the Poisson model is our preferred 
model, we also list the individual fits between stacked cluster properties in Table [3] for 
comparison with other studies. 

The last point we consider is whether the matched filter algorithm leads to cluster 
catalogs with redshift-dependent biases. We test for the biases by dividing each richness 
bin into three redshift bins with the boundaries chosen so that each bin produced a stacked 
spectrum with a similar signal-to-noise ratio. We also created two overlapping (i.e. not 
independent) redshift bins centered on the edges of the first sets of bins to better illustrate 
any variations with redshift. For each of these bins we extracted the spectra as described in 
§4, normalized each spectrum to the mean redshift of the bin, fit the spectra to obtain the 
luminosity and temperature and estimated the uncertainties using bootstrap re-sampling. 
The results are shown in Figures [12] and dU either as the raw results or normalized to 
remove the variations in the mean richness between the redshift bins. The error-bars for 
temperatures are larger than those for the luminosities and we only show results from the 
first three richness bins. 

The qualitative result is that there are few signs of redshift dependent biases beyond 
the expectations of the Poisson model. The most massive clusters (richnesses and 1) show 
a slow decline in the mean luminosity and temperature at fixed richness because of the 
increase in the Malmquist biases with the diminishing numbers of detectable galaxies. The 
intermediate richness class 1 is far enough from both the break in the mass function and the 
lower mass limit to show little variation with redshift. The two lowest richness classes show 
a rapidly rising luminosity with redshift. In all cases, the trends are well matched by the 
simple Poisson model - we see no evidence for additional systematic biases. 
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Table 3. Fitting Results for L x -T x -N* m6 -M . 



Relation Reference Model loglo k logXg m n xl(dof) 



Individual Fits a 



L x -T 


this paper 
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43.11 ±0.23 






10+0.59 
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this paper 
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this paper 
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this paper 
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this paper 
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26 ±0.32 
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K03 
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-0.25 ± 0.14 
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09 ±0.17 













this paper 
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0.56 ±0.13 






0.13 ± 0.13 


1.35 ±0.42 


4.0 ± 1.2 


0.4(5) 


T - N, 6m 


this paper 


SPL 


-0.01 ± 0.03 





36 ± 0.06 









1.4(7) 




this paper 


BPL 


0.09 ±0.04 






0.51 ± 0.27 


0.74 ±0.17 


0.20 ±0.07 


0.4(5) 


M.666 - T 


this paper 


SPL 


13.64 ±0.05 
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79 ±0.17 









0.1(7) 


M 200 - T 


this paper 


SPL 


13.56 ±0.05 
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59 ±0.17 









1.0(7) 


M*666 - iV*666 


this paper 


SPL 


13.64 ±0.06 





74 ±0.10 









1.1(7) 




this paper 


BPL 


13.79 ±0.09 






0.48 ± 0.30 


1.27 ±0.28 


0.36 ±0.17 


0.1(5) 


M200 - iV*666 


this paper 


SPL 


13.58 ±0.06 





56 ±0.10 









2.9(7) 




this paper 


BPL 


13.53 ± 0.09 






0.49 ± 0.18 


1.44 ±0.27 


-0.05 ± 0.17 


0.1(5) 



Poisson Model Fits c 


Lx-T 
M200 - T 


this 
this 


paper 
paper 


BPL 
SPL 


42.55 ±0.05 ■•■ 0.00 ±0.05 3.03 ±0.17 
13.58 ±0.05 1.65 ±0.12 


10.3 ±2.4 



Note. — The single power law (SPL) is denned by Y = Y (X/X ) k while the broken power law (BPL) is Y = Y (X/X ) 
for X > X and Y = Y (X/X ) n for X < X . 

a Fits to scaling relations between two cluster parameters only. 

b Kochanek ct al. (2003). 

c The fitting results for the global Poisson model (see §3). 
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5. Summary 

Optically-selected cluster catalogs have always been suspect because of concerns about 
false detections and redshift-dependent biases created by chance alignments of galaxies. 
Current attempts to understand the biases of optically-selected catalogs have focused either 
on testing the algorithms in mock galaxy catalogs (Kochanek et al. 2003; Miller et al. 2005) 
or by using weak lensing measurements (Sheldon et al. 2001). Unfortunately, there has 
never been a complete X-ray survey of a large optically-selected cluster catalog to provide 
an independent test of these concerns. In this study we have used a stacking analysis of the 
ROSAT All-Sky Survey data to determine the average X-ray properties of a sample of 4333 
clusters found using the matched filter algorithm of Kochanek et al. (2003) applied to the 
2MASS galaxy survey with a magnitude limit of K< 13.25 mag. 

After dividing the clusters into bins of optical richness, we could successfully measure 
the surface brightness profiles, bolometric X-ray luminosities, X-ray temperatures, masses 
and gas entropies for the averaged clusters, finding results that are generally consistent with 
those from analyzing individual clusters. The one exception is that we tend to find larger core 
radii for /9-model fits to the surface brightness presumably because of additional smoothing 
created by errors in the optical positions. The stacked clusters cleanly follow the correlations 
observed for samples of individual clusters between X-ray luminosity, temperature and mass. 
However, in order to interpret the correlations between the optical richness and the X-ray 
luminosity, temperature and mass, it is necessary to model the effects of Poisson fluctuations 
of the number of galaxies in a cluster because in the presence of these fluctuations averaging 
at fixed richness is very different from averaging at fixed mass. These effects matter for 
high richness clusters observed at redshifts where they will contain few detectable galaxies, 
and for all low richness clusters. A very simple Poisson model for the effects of these fluc- 
tuations naturally reproduces our results. We note that Stanek et al. (2006) have recently 
demonstrated similar effects arising from the scatter in X-ray luminosity at fixed cluster 
mass. 

We also examined the problem of redshift-dependent biases in optical catalogs by ex- 
amining the X-ray properties of clusters of fixed richness as a function of redshift. The low 
redshifts of the 2MASS clusters (z < 0.1) means that we need worry little about genuine 
cosmological evolution. Our statistical model predicts that high richness clusters will show a 
slowly declining X-ray luminosity, while low richness clusters will show a rising X-ray lumi- 
nosity, and this agrees with our measurements. Essentially, Malmquist biases created by the 
much more abundant low-mass clusters become steadily more important for the high mass 
clusters as the numbers of detectable member galaxies decline, while the reverse occurs for 
the low-mass clusters. In general, however, samples restricted to clusters containing at least 3 
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detectable galaxies (already a floor required to minimize the abundance of false positives) and 
to the higher mass clusters rather than groups (T > 2 keV, M > 10 14 M Q , L x £ 10 43 ergs/sec) 
have corrections due to the effects of Poisson fluctuations that are relatively simple to model 
and depend little on the model for the cutoff between groups and galaxies. Models for 
the lower mass groups are more challenging, particularly if variations with redshift must be 
included. We see no evidence for large systematic errors created by chance superpositions. 

Despite the shallowness of the RASS, our approach (see also Bartelmann & White 
2003) can be applied to analyses of cluster samples with much higher mean redshifts than 
the 2MASS sample, where even the richest clusters are tracked only to z <0.1. Modulo 
redshift scalings for K-corrections and between cosmological distances, the signal-to-noise 
ratio of a stacking analysis scales as the square root of the product of the survey area 
and depth. Thus, an analysis of clusters in 10000 deg 2 of the SDSS survey (25% of the 
area of 2MASS), which can detect rich clusters with z <0.5 rather than z <0.1 (5 times 
the depth), should have a signal-to-noise ratio comparable to our present analysis for the 
measurement of the mean X-ray properties of the clusters. Analyses of rich clusters over 
1000 deg 2 would have a signal-to-noise ratio comparable to our analysis of clusters with 
~ 1/3 the richness. This assumes that the analysis remains limited by statistical errors 
rather than being dominated by systematic problems such as contamination by faint point 
sources or difficulty in controlling the background. 

This publication makes use of data products from the Two Micron All Sky Survey, which 
is a joint project of the University of Massachusetts and the Infrared Processing and Analysis 
Center/ California Institute of Technology, funded by the National Aeronautics and Space 
Administration and the National Science Foundation. This work makes use of the RASS 
data achieved by the HEASARC, a service of the Exploration of the Universe Division at 
NASA/GSFC and the High Energy Astrophysics Division of the Smithsonian Astrophysical 
Observatory. We thank J. Huchra for his assistance in generating the new group catalogs. 
We thank M. F. Corcoran, S. Snowden, D. Grupe, E. Rykoff and T. Mckay for helpful 
discussion on the stacking of RASS data, and M. W. Bautz, G. Evrard, D. Weinberg, M. 
White, Z. Zheng for their comments on the results. 
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Fig. 9. — The relationship between temperature and entropy at 0.5 r^ 666 . The slope of the 
solid line is the S oc T 65 relation from Ponman et al. (2003) and the dotted line is the 
S oc T relation expected from simple scaling relation. 
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Fig. 10. — The relationship of the bolometric X-ray luminosity to the optical richness iV* 666 . 
The filled circles with error-bars are our present results, using the subdivided bins for the first 
four richness bins. The filled triangles are data from the middle redshift bin (see Figs. [TBI 
UM . The solid line shows the best fit Poisson model to the data, and the dashed line shows 
the true relationship between L x and iV* 666 we would observe in the absence of Poisson 
fluctuations. The squares are individual 2MASS clusters drawn from the literature. 
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Fig. 11. — The relationship between X-ray temperature and optical richness iV*666- The 
filled circles with error-bars are our present results, using the subdivided bins for the first 
four richness bins. The filled triangles are for the clusters in the middle redshift bin (see 
Figs. [13l [14|). The solid line shows the best fit Poisson model to the data, and the dashed 
line shows the true relationship between T and iV*666 we would observe in the absence of 
Poisson fluctuations. The squares are individual 2MASS clusters drawn from the literature. 



0.1 1.0 10.0 

N*666 



Fig. 12. — The relationship between mass M 20 o and optical richness iV* 666 . The filled and 
open circles are the M 20 o and M 666 values, and the error-bars for the open circles are similar 
to those for the close circles. The solid line shows the best fit Poisson model to the data 
for M 200 , and the dashed line shows the true relationship between M 2 oo and 7V* 666 we would 
observe in the absence of Poisson fluctuations. 
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Fig. 13. — X-ray luminosity versus redshift for stacked clusters in richness bins (top) 
to 4 (bottom). The upper panel shows the raw results, the lower panel corrects for the 
difference between the mean richness of all clusters in the richness bin and the mean richness 
in the individual redshift bins. The even and odd points in the redshift sequences are not 
independent measurements. The lines in both panels are the best fit Poisson models for each 
richness bin. 
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Fig. 14. — Temperature versus redshift for stacked clusters in richness bins (top), 1, 
and 2 (bottom). The upper panel shows the raw results, the lower panel corrects for the 
difference between the mean richness of all clusters in the richness bin and the mean richness 
in the individual redshift bins. The even and odd points in the redshift sequences are not 
independent measurements. The lines in both panels are the best fit Poisson models for each 
richness bin. 



